NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The SCALES Project: Making Federal Court Records Free

https://doi.org/10.2139/ssrn.4948027

Schwartz, David L; Albrecht, Kat; Pah, Adam; Cotropia, Christopher Anthony; Sanders, Amy Kristin; Sanga, Sarath; Alexander, Charlotte; Amaral, Luis_A N; Clopton, Zachary D; Tucker, Anne M; et al (September 2024, Northwestern University Law Review)

Federal court records have been available online for nearly a quarter century, yet they remain frustratingly inaccessible to the public. This is due to two primary barriers: (1) the federal government's prohibitively high fees to access the records at scale and (2) the unwieldy state of the records themselves, which are mostly text documents scattered across numerous systems. Official datasets produced by the judiciary, as well as third-party data collection efforts, are incomplete, inaccurate, and similarly inaccessible to the public. The result is a de facto data blackout that leaves an entire branch of the federal government shielded from empirical scrutiny. In this Essay, we introduce the SCALES project: a new data-gathering and data-organizing initiative to right this wrong. SCALES is an online platform that we built to assemble federal court records, systematically organize them and extract key information, and-most importantly-make them freely available to the public. The database currently covers all federal cases initiated in 2016 and 2017, and we intend to expand this coverage to all years. This Essay explains the shortcomings of existing systems (such as the federal government's PACER platform), how we built SCALES to overcome these inadequacies, and how anyone can use SCALES to empirically analyze the operations of the federal courts. We offer a series of exploratory findings to showcase the depth and breadth of the SCALES platform. Our goal is for SCALES to serve as a public resource where practitioners, policymakers, and scholars can conduct empirical legal research and improve the operations of the federal courts. For more information, visit www.scales-okn.org.
more » « less
Full Text Available
A user-centered approach to developing an AI system analyzing U.S. federal court data

https://doi.org/10.1007/s10506-022-09320-z

Adler, Rachel F.; Paley, Andrew; Li Zhao, Andong L.; Pack, Harper; Servantez, Sergio; Pah, Adam R.; Hammond, Kristian; Consortium, SCALES OKN (August 2022, Artificial Intelligence and Law)

Abstract We implemented a user-centered approach to the design of an artificial intelligence (AI) system that provides users with access to information about the workings of the United States federal court system regardless of their technical background. Presently, most of the records associated with the federal judiciary are provided through a federal system that does not support exploration aimed at discovering systematic patterns about court activities. In addition, many users lack the data analytical skills necessary to conduct their own analyses and convert data into information. We conducted interviews, observations, and surveys to uncover the needs of our users and discuss the development of an intuitive platform informed from these needs that makes it possible for legal scholars, lawyers, and journalists to discover answers to more advanced questions about the federal court system. We report on results from usability testing and discuss design implications for AI and law practitioners and researchers.
more » « less
Full Text Available
The Promise of AI in an Open Justice System

https://doi.org/10.1002/aaai.12039

Pah, Adam R; Schwartz, David L; Sanga, Sarath; Alexander, Charlotte S; Hammond, Kristian J; Amaral, Luís A.N. (March 2022, AI Magazine)

Full Text Available
PRESIDE: A Judge Entity Recognition and Disambiguation Model for US District Court Records

https://doi.org/10.1109/BigData52589.2021.9671351

Pah, Adam R.; Rozolis, Christian J.; Schwartz, David L.; Alexander, Charlotte S.; Okn Consortium, Scales (December 2021, 2021 IEEE International Conference on Big Data (Big Data))

The docket sheet of a court case contains a wealth of information about the progression of a case, the parties’ and judge’s decision-making along the way, and the case’s ultimate outcome that can be used in analytical applications. However, the unstructured text of the docket sheet and the terse and variable phrasing of docket entries require the development of new models to identify key entities to enable analysis at a systematic level. We developed a judge entity recognition language model and disambiguation pipeline for US District Court records. Our model can robustly identify mentions of judicial entities in free text (~99% F-1 Score) and outperforms general state-of-the-art language models by 13%. Our disambiguation pipeline is able to robustly identify both appointed and non-appointed judicial actors and correctly infer the type of appointment (~99% precision). Lastly, we show with a case study on in forma pauperis decision-making that there is substantial error (~30%) attributing decision outcomes to judicial actors if the free text of the docket is not used to make the identification and attribution.
more » « less
Full Text Available
From data to information: automating data science to explore the U.S. court system

https://doi.org/10.1145/3462757.3466100

Paley, Andrew; Zhao, Andong L.; Pack, Harper; Servantez, Sergio; Adler, Rachel F.; Sterbentz, Marko; Pah, Adam; Schwartz, David; Barrie, Cameron; Einarsson, Alexander; et al (June 2021, ICAIL '21: Proceedings of the Eighteenth International Conference on Artificial Intelligence and Law)
null (Ed.)
The U.S. court system is the nation's arbiter of justice, tasked with the responsibility of ensuring equal protection under the law. But hurdles to information access obscure the inner workings of the system, preventing stakeholders - from legal scholars to journalists and members of the public - from understanding the state of justice in America at scale. There is an ongoing data access argument here: U.S. court records are public data and should be freely available. But open data arguments represent a half-measure; what we really need is open information. This distinction marks the difference between downloading a zip file containing a quarter-million case dockets and getting the real-time answer to a question like "Are pro se parties more or less likely to receive fee waivers?" To help bridge that gap, we introduce a novel platform and user experience that provides users with the tools necessary to explore data and drive analysis via natural language statements. Our approach leverages an ontology configuration that adds domain-relevant data semantics to database schemas to provide support for user guidance and for search and analysis without user-entered code or SQL. The system is embodied in a "natural-language notebook" user experience, and we apply this approach to the space of case docket data from the U.S. federal court system. Additionally, we provide detail on the collection, ingestion and processing of the dockets themselves, including early experiments in the use of language modeling for docket entry classification with an initial focus on motions.
more » « less
Full Text Available
How to build a more open justice system

https://doi.org/10.1126/science.aba6914

Pah, Adam R.; Schwartz, David L.; Sanga, Sarath; Clopton, Zachary D.; DiCola, Peter; Mersey, Rachel Davis; Alexander, Charlotte S.; Hammond, Kristian J.; Amaral, Luís A. (July 2020, Science)

Full Text Available

Search for: All records